Model Selection

Efficient Quantization Deployment

# Efficient Quantization Deployment

Meta Llama 3.1 8B GGUF

The GGUF quantized version of Meta-Llama-3.1-8B, generated using the llama.cpp tool, supports multilingual text generation tasks.

Large Language Model Supports Multiple Languages

Meta Llama Llama 4 Scout 17B 16E Instruct Old GGUF

Llama-4-Scout-17B-16E-Instruct is a 17B parameter instruction fine-tuned large language model released by Meta, which has undergone quantization processing to improve operational efficiency.

Large Language Model Supports Multiple Languages

Minicpm O 2 6 Gguf

MiniCPM-o 2.6 is a multimodal model that supports vision and language tasks, specifically designed for llama.cpp.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase